Modeling RNA degradation for RNA-Seq with applications.

نویسندگان

  • Lin Wan
  • Xiting Yan
  • Ting Chen
  • Fengzhu Sun
چکیده

RNA-Seq is widely used in biological and biomedical studies. Methods for the estimation of the transcript's abundance using RNA-Seq data have been intensively studied, many of which are based on the assumption that the short-reads of RNA-Seq are uniformly distributed along the transcripts. However, the short-reads are found to be nonuniformly distributed along the transcripts, which can greatly reduce the accuracies of these methods based on the uniform assumption. Several methods are developed to adjust the biases induced by this nonuniformity, utilizing the short-read's empirical distribution in transcript. As an alternative, we found that RNA degradation plays a major role in the formation of the short-read's nonuniform distribution and thus developed a new approach that quantifies the short-read's nonuniform distribution by precisely modeling RNA degradation. Our model of RNA degradation fits RNA-Seq data quite well, and based on this model, a new statistical method was further developed to estimate transcript expression level, as well as the RNA degradation rate, for individual genes and their isoforms. We showed that our method can improve the accuracy of transcript isoform expression estimation. The RNA degradation rate of individual transcript we estimated is consistent across samples and/or experiments/platforms. In addition, the RNA degradation rate from our model is independent of the RNA length, consistent with previous studies on RNA decay rate.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Herpes Simplex Virus Virion Host Shutoff Gene- a New Suicide Gene- on Tumor Cells

Background: The herpes simplex virus (HSV) UL41 gene product, virion host shutoff (Vhs) protein, mediates the rapid degradation of both viral and cellular mRNA. This ability suggests that Vhs protein can be used as a suicide gene in cancer gene therapy applications. The recent reports have shown that the degradation of cellular mRNA during herpes simplex infection is selective. RNA containing A...

متن کامل

Determination of in vivo RNA kinetics using RATE-seq.

The abundance of a transcript is determined by its rate of synthesis and its rate of degradation; however, global methods for quantifying RNA abundance cannot distinguish variation in these two processes. Here, we introduce RNA approach to equilibrium sequencing (RATE-seq), which uses in vivo metabolic labeling of RNA and approach to equilibrium kinetics, to determine absolute RNA degradation a...

متن کامل

Investigating the Function of Predicted Proteins from RNA-Seq Data in Holstein and Cholistani Cattle Breeds

This study was performed to determine the digital expression profile of different genes expressed in Holstein and Cholistani breeds as well as to evaluate the performance of predicted proteins derived from differentially expressed genes between these two breeds using RNA-Seq data. For this purpose, the whole mRNA sequence for a blood sample of American Holstein and Pakistani Cholistani cattle p...

متن کامل

مطالعه بیان ژن افتراقی زنبور عسل ملکه، نر و کارگر با استفاده از داده‌های RNA-seq

این پژوهش با هدف مطالعه پروفایل بیان ژن و تعیین ژن‌های شاخص در تمایز و تکامل ملکه، نر و کارگر با مقایسه تفریقی آن‌ها در سن 5 لاروی یا همان سن تمایز انجام شد. لذا ترانسکریپتوم (توالی کل mRNA) 15 نمونه از زنبور عسل نژاد ایتالیایی (A. m. ligustica) شامل 5 زنبور نر، 5 کارگر و 5 ملکه از طریق همردیفی و مکان­یابی خوانش­های RNA-Seq بر روی ژنوم مرجع زنبور عسل نسخه Amel_4.5 مرتب شد. سپس آنالیز بیان افترا...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biostatistics

دوره 13 4  شماره 

صفحات  -

تاریخ انتشار 2012